Dynamics of strategy distribution in iterated games

نویسندگان

  • Stéphane Airiau
  • Sabyasachi Saha
  • Sandip Sen
چکیده

Evolutionary tournaments have been used as a tool for comparing game-playing strategies. For instance, in the late 1970’s, Axelrod organized tournaments to compare strategies for playing the iterated prisoner’s dilemma (PD) game. While these tournaments and later research have provided us with a better understanding of successful strategies for iterated PD, our understanding is less clear about strategies for playing iterated versions of arbitrary single-stage games. While solution concepts like Nash equilibria has been proposed for general-sum games, learning strategies like fictitious play may be preferred for playing against sub-rational players. In this paper, we discuss the relative performance of both learning and non-learning strategies in different population distributions including those that are likely in real-life. The testbed used to evaluate the strategies includes all possible structurally distinct 2×2 conflicted games with ordinal payoffs. Plugging head-to-head performance data into an analytical finite-population evolution model allows us to evaluate the evolutionary dynamics of different initial strategy distributions. Two key observations are that (a) the popular Nash strategy is ineffective in most tournament settings, (b) simple strategies like best response benefit from the presence of learning strategies and we often observe convergence to a mixture of strategies rather than to a single dominant strategy. We explain such mixed convergence using head-to-head performance results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Human Gait Control Using Functional Electrical Stimulation Based on Controlling the Shank Dynamics

Introduction: Efficient gait control using Functional Electrical Stimulation (FES) is an open research problem. In this research, a new intermittent controller has been designed to control the human shank movement dynamics during gait. Methods: In this approach, first, the three-dimensional phase space was constructed using the human shank movement data recorded from the healthy subjects. Then...

متن کامل

Sampling best response dynamics and deterministic equilibrium selection

We consider a model of evolution in games in which a revising agent observes the actions of a random number of randomly sampled opponents and then chooses a best response to the distribution of actions in the sample. We provide a condition on the distribution of sample sizes under which an iterated p-dominant equilibrium is almost globally asymptotically stable under these dynamics. We show und...

متن کامل

Iterated Boolean Games

Iterated games are well-known in the game theory literature. We study iterated Boolean games. These are games in which players repeatedly choose truth values for Boolean variables they have control over. Our model of iterated Boolean games assumes that players have goals given by formulae of Linear Temporal Logic (LTL), a formalism for expressing properties of state sequences. In order to model...

متن کامل

Zero-determinant strategies in iterated multi-strategy games

Self-serving, rational agents sometimes cooperate to their mutual benefit. The two-player iterated prisoner’s dilemma game is a model for including the emergence of cooperation. It is generally believed that there is no simple ultimatum strategy which a player can control the return of the other participants. The recent discovery of the powerful class of zero-determinant strategies in the itera...

متن کامل

Controller Placement in Software Defined Network using Iterated Local Search

Software defined network is a new computer network architecture who separates controller and data layer in network devices such as switches and routers. By the emerge of software defined networks, a class of location problems, called controller placement problem, has attracted much more research attention. The task in the problem is to simultaneously find optimal number and location of controll...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004